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Abstract 

Relay nodes-enhanced architectures are deemed a viable solution to enhance coverage and capacity of nowa¬ 
days cellular networks. Besides a number of desirable features, these architectures reduce the average distance 
between users and network nodes, thus allowing for battery savings for users transmitting on the uplink. In this 
paper, we investigate the extent of these savings, by optimizing relay nodes deployment in terms of uplink energy 
consumption per transmitted bit, while taking into account a minimum uplink average user delay that has to be 
guaranteed. A novel performance evaluation framework for uplink relay networks is first proposed to study this 
energy-delay trade-off. A simulated annealing is then run to find an optimized relay placement solution under a 
delay constraint; exterior penalty functions are used in order to deal with a difficult energy landscape, in particular 
when the constraint is tight. Finally, results show that relay nodes deployment consistently improve users uplink 
energy efficiency, under a wide range of traffic conditions and that relays are particularly efficient in non-uniform 
traffic scenarios. 


I. Introduction 

Heterogeneous network deployments in modern cellular networks are regarded as a promising solution to meet 
the ever-increasing demand of wireless data and voice traffic. They consist in installing a number of low-power 
nodes, possibly of different types (e.g., femtocells, Relay Nodes (RNs), etc.), inside the coverage area of macro 
base stations (also called eNodes-B (eNBs) in this paper), resulting in a more dense network architecture. Indeed, 
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a higher density of nodes entails a number of benefits, e.g., coverage and capacity are boosted ||T] and power 
efficiency improves Q, due to the reduced distance between User Equipments (UE) and serving nodes Q. This 
generates operational expenditures savings for operators, and lowers the environmental impact of their infrastructure. 
Results in 0,0 et. al. show that the introduction of RNs and picocells can reduce downlink power consumption. 
However, this applies to the uplink as well (see e.g. Q, Q), and the decrease in battery energy consumption for 
users can be consistent: 0 > 0 indicate that uplink transmit power has a strong impact on users overall energy 
demand, especially for high transmit powers. 

Hence, heterogeneous networks can be considered as an effective means to extend mobile users batteries duration, 
and several research works are dedicated to this topic. Authors of and 0 propose a game theory-based framework 
for the maximization of femtocells uplink energy efficiency, by means of users transmit powers tuning Q or radio 
resources assignment and power control parameters optimization 0. In fig, closed-form expressions of the Signal 
to Interference Ratio (SIR) and of the outage probability are derived and used to measure the impact of femtocells 
coverage and users density on energy efficiency. Both Q and 0 apply Quality of Service (QoS)-related constraints 
to the optimization problem, so as to avoid solutions with high energy efficiency but poor performance. Other works 
in the area of cellular networks, e.g. 1ID> address this issue by jointly optimizing power and user experienced QoS. 

The close interaction between energy consumption and performance im makes the problem of energy efficiency 
maximization in relay-enhanced networks different from that addressed in 0.0 et al. for the femtocells deployment 
case; this is because RNs communicate to their donor node through a wireless backhaul link which represents 
a well-known performance bottleneck. On the contrary, small cells and femto cells benefit from a wired backhaul, 
where the delay issue is less crucial. Surprisingly, uplink RN networks energy consumption has received limited 
attention in the literature, to the best of our knowledge. Reference GD proposes an optimization of the uplink power 
control in relay-based networks but ignores energy consumption. Authors of @, 03 treat the maximization of 
uplink energy efficiency via either optimal assignment of subcarriers, users powers and bit allocations 0, or optimal 
radio resources allocation OD- More than one order of magnitude can be achieved in user power consumption 03- 

A drawback of 0 is that the decisive impact of co-channel interference is neglected. Also, the paper assumes 
that users are fixed in number and position, and always have data to transmit. Hence, the influence of the traffic 
intensity is not considered, while a number of studies (see e.g. 03 . 03 ) show the importance of the traffic load 
for energy efficiency evaluation. Einally, 0, 03 lack of a thorough theoretical framework for the analysis of 
relay-enhanced cellular networks uplink energy efficiency, and they are based on the sole minimization of users 
energy consumption, without investigating the necessary tradeoff between this consumption and uplink performance. 
Hence, the need arises for a finer model to study this tradeoff. 

The contributions of this paper are the following. 

• We study the tradeoff between energy consumption and users experienced delay in uplink relays cellular 
networks. This is the first study of this type, to the best of our knowledge. We show that in many cases, relays 
can help both saving energy and reducing delays despite the constraint imposed by the wireless backhaul. We 
also highlight the interest of using relays in scenarios, where traffic is not uniform. 
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In order to find upper bounds on the achievable gains in terms of energy saving and/or average delay, we 
formulate a constrained optimization problem. Its objective is to place the relays and tune the network 
parameters so as to minimize the energy consumption per transmitted bit under delay constraints. To solve 
this problem, we rely on a Simulated Annealing (SA) algorithm enhanced with exterior search and penalty 
functions. We discuss the effectiveness of our approach for addressing our problem. 

• We propose a framework for the analysis of energy efficiency in uplink relay networks. In order to study the 
delay constraint, we consider the dynamic nature of cellular traffic, by means of a model of UE arrivals and 
departures. This results in a hierarchical flow level analysis. The loads of eNBs and RNs are accounted in 
the estimation of interference and transmission delays. Our propagation model includes shadowing and fast 
fading, and UEs power control is considered. Overall, the proposed model is more comprehensive, compared 
to the existing literature on uplink energy efficiency in heterogeneous networks. 

• We adopt a non-trivial scheduling scheme, which represents an approximation of the Proportional Eair (PE) 
scheduler, and derive the probability density function of the Signal to Interference plus Noise Ratio (SINR) of a 
scheduled UE. The choice of the scheduling algorithm has indeed a decisive impact on the delay performance. 
This is a novel contribution, to the best of our knowledge. 

The paper is organized as follows. Section introduces our system model, while Section III describes the 
framework used for evaluating energy consumption per transmitted bit and delay. Section |IV] is devoted to the 
optimization problem and our proposed algorithm, while Section]^ gives our numerical results. Einally, Section VI 
concludes our work. 


II. System Model 

We work on the uplink of a cellular network, where transmissions are performed on synchronized frames, and 
each network station (i.e. eNB or RN) is associated an uplink frame. Erames are partitioned into Radio Blocks 
(RBs) of the same bandwidth and time duration, which are labeled with an index r. Each RB r in the frame of 
station k can be either granted to one of the UEs served by k, or be left unused (if there is no UE to be served). 

The network is composed of a set JC of K stations, and divided into cells; let /Cs be the set of eNBs, and ICr 
the set of RNs, so that /C = /Cs U JCr. We focus on one given cell c containing one donor eNB GD and Urn 
RNs, which are connected to the eNB by means of a wireless backhaul link (Eig. [T]). The set of stations of cell c 
is denoted JCc- The surface Ac of cell c is dehned as the region where UEs are served by one of the stations of c, 
and its area is denoted Ac- Similarly, surface Ak, of area Ak, is the region where UEs are served by station k. 

UEs select their serving station based on the highest product of received downlink pilot power times a station 
specihc bias. They are then served over the access link. The technique of biasing the user association is referred 
to in literature as cell range expansion HD’ and it is deemed beneficial for network performance as it allows for 
load balancing. We assume that eNBs have a bias of 1 and RNs have a common bias B. 
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Figure 2: Frame structure. 


Figure 1: Uplink model. 


A. Frame Structure 

Access link and backhaul link transmissions are dedicated two orthogonal sections of the frame (Fig. [^. RN 
transmissions on the backhaul link, which uses a quota /3 of the RBs, do not interfere with UEs transmissions on 
the access link. This choice is widely adopted in the literature (see e.g. | fT3] , fir)), as it avoids interference between 
RNs and UEs on the uplink. Eor the same reason, we assume that /3 is the same for all network stations. The value 
of P is supposed to be set by the operator and based on considerations on the overall network performance, while 
our analysis is focused on the performance related to one given network cell, given /?. 

B. Trajfic Model 

We assume that UEs arrive in the network according to a spatial Poisson point process of intensity A(s) [arrivals/s/wP], 
transmit a flow of average size ^ \bits\ to their serving station, and leave the network. Elows transmitted from UEs 
to a RN on the access link are then forwarded by the RN to its donor eNB on the backhaul link. 

The traffic density u}{s) at a given location s is denoted by a;(s) = A(s)^ \bits/s/mP], while the average traffic 
density u) in the network can be computed as a) = (1/A„u,) \{s)^ds [pits/s/m?], where Anw is the overall 

network surface, of area We define (f){s) = uj{s)/oj,\/s G Anw, to account for the local variation of the 

traffic density with respect to a). The ratio (j){s)/Anw can be seen as the spatial Probability Density Eunction (PDE) 
of UEs arrivals. We also define (/)(s)(is for the sake of further developments. 

The typical number of bits carried by a RB is assumed to be much smaller than Hence, access link buffer 
queue of all stations k G JC can be modeled as an M/G/l/PS queue (Eig. |^, and its load is denoted with p/,. It is 
the sum of load contributions Pfe(s) over Ak, so that pk = Qk{s)ds. Only stable scenarios, i.e., pk < l,yk G K. 
are considered in this work. We define the vectors p = [pi • • • pk] and p_j. = [pi ■ ■ ■ Pk-i Pk+i ■ ■ ■ Pk], for 

*We assume large enough so that interference in cell c is accuratly computed. 
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UE UE UE UE UE UE 

Figure 3: Queuing model. 


the sake of further developments. Let introduce a binary random variable ©^(t), which is equal to one when RB 
r in station k frame is granted to a UE, and equal to zero otherwise. We will assume that the probability for any 
station k to receive data on RB t depends only on pk, and is equal to ET-[0fc(r)] = pk- 

Backhaul link buffer queue is also modeled as a multi-class M/G/l/PS (Fig. 0. Each flow in the backhaul queue 
belongs to a class k € {1 • • • njjjv}, according to the transmitting RN. The model shown in Fig. [^thus allows for 
a hierarchical flow level analysis. 
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C. Propagation Model 

Consider a UE transmitting on RB r, and located on s. We denote with T{s) its transmit power, and with Gk{s) 
the component of the channel gain towards station k due to distance dependent attenuation and shadowing. We 
assume Gk{s) constant on all RB^ The power Pk{s) received by k from the considered UE is thus obtained as: 
Pfc(s, r) = T{s)Gk{s)i'{s, t), where v{s, r) is the variable component due to the fast fading. We adopt here a block 
Rayleigh fading model 09), i.e., fast fading is constant on a RB, and fast fading realizations associated to any two 
distinct RBs or locations are independent of one another. The Eull Compensation Power Control (ECPC) scheme 
adopted by the LTE standard 0 is chosen to determine T{s). We thus have: T(s) = rninlTmaxi Ckis) ^ ^ 
where T„iax is the UE maximum transmit power and P is a target received power broadcast by k to all UEs. We 
assume that P has the same value P=PeNB for all eNBs, and P=Prn for all RNs, for the sake of simplicity. 


D. SINK model 


Consider a UE located in s and a RB t. We define the instantaneous SINR ^k{s,T) experienced by station k 
for the considered UE as: 


lk{s,T) 


_ Pk{s,T) _ 

'EjeKJ^k ^3MPkifjiT),T) + N' 


( 1 ) 


where fj{T)GAj yields the position of the UE scheduled by station j on RB r, and N is the thermal noise power. 

In the following, we drop the dependency of all variables on t, for the sake of simplicity. Let p^^kis, z),z > 0 be the 
PDE of 7 fe(s). Eollowing [ pO) , p7) , we approximate p-y^k{s, z) by a lognormal distribution, i.e., 7 j;{s)~Log—A/'(/r.y a^^k) 

(see App. I^for the derivation of p^^k and a-^ k)- 


E. Scheduling Policy 

Let assume that stations have perfect Channel State Information (CSI) about all served UEs and consider a given 
frame of station k. We label the UEs served by k during the considered frame with an index i = 1 ■ ■ ■ U, while 
their locations are denoted by si ■ ■ ■ su respectively. Every station implements a Maximum Quantile Scheduling 


|, 1231 (MQS) among served UEs. The MQS scheduler of station k assigns RB r to the UE i that maximizes 
its instantaneous SINR jk{si, r), with respect to jkisi, t — h),h = 1 over a window of W RBs (more 

details can be found in App. |^. This policy has the property of being fair in RBs allocation between UEs, while 
maintaining a good throughput performance . Moreover, 1231 shows that the performance degradation introduced 


by an imperfect estimation of the SINR distribution can be lower than that incurred in practical implementations of 
other popular scheduling algorithms. The MQS scheduling differs from the well-known PE scheduler p4) , because 
the latter allocates each radio resource to the user maximizing its scaled SINR 7 fc(si, T)/jkisi) on r, where yk{Si) 
represents the average SINR experienced by k for user i over the last W RBs. However, for large values of W its 
behavior approximates that of a PE scheduler (see e.g. I®). 


^The underlying assumption is that shadowing does not significantly change in time, for a fixed location. This is consistent with, e.g., the 
measurements in 1181. 
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Now, consider the distribution Tr,^ ^(sj, z)dz = P{z<ji:{si)<z+dz \ fk=Si), Si G Ak, z > 0 of 7 ^( 5 ^) conditioned 
to fk=Si, i.e., the PDF of the SINR of a scheduled UE. Contrary to p^^k{si,z), this distribution is not lognormal. 


Theorem 1. The distribution 'K^^k{si,z) can be expressed as: 

w 

TTj,k{s^, z) = Pj,k{Si, z) ^ Qn{i, z)Tk(W, n), 

n—1 

where 




TkiW,n) = 


W-1 

n—1 


pz \ W—n 

J p^^k{si,t)dtj X 

1 - / p^,k{si,t)dt 
Jo J 


W\l-pk) 

(w~-pk{W-n)) 


2 ■ 


Proof: See App. 


( 2 ) 


F. UEs Physical Data Rate 

We dehne the function < 7 ( 7 ) > 0, yielding the physical data rate achieved, when the receiver experiences SINR 
7 (or equivalently the throughput of a user, if it were alone in the serving area of its serving station). We assume 
that ( 7 ( 7 ) is non decreasing in 7 . 


III. Performance Evaluation 

In this section, we dehne and derive our performance parameters in terms of delay and energy consumption per 
transmitted bit. We hrst obtain the expressions of access and backhaul link loads, and then we use them to get How 
transmission delays. 


A. Access Link Load 

Let focus on station k, and recall that gk{s), s G Ak is the contribution of s to the load pk of station k. We have 
the following Lemma; 


Lemma 1. The local contribution Qk{s), s G Ak to the load of the access link buffer queue of station k is expressed 
by: 

ek{s) = Y 


jfis) 


C{'lkis,T)) 


fkir) = s 


(3) 


Proof: The contribution to the load is expressed as the ratio of the traffic density ui(j){s) in s to the uplink rate 
achieved by a scheduled UE in s. The term (1 — /3) takes into account the quota of the frame RBs dedicated to the 
backhaul link. ■ 
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Theorem 2. The load pk of the access link of station k can be expressed as: 


Pk = 


OJ 


1-/3 


Ak 


Hs) 


C{z) 


dzds. 


(4) 


Proof: The load pk is the sum of the contributions Qk{s) over Ak'- Pk = Qk{s)ds. Now, the term 
ET-[ 1 /C'( 7 fc(s, T))|/fc(T) = s] in (j^ can be expressed according the law of total probability, obtaining (|^. ■ 

Note first that if all UEs transmit at power T(s) < TmaxiVs G Ak, then 7r^^k{s,z) = Tr^^k{z), Vs S Ak and 


equation (4i reduces to; pk = 


P “-7 


C(l) 


. can thus be seen as the effective area covered by k, in terms of 


traffic density. 

Note then that p^ is a function of p_]^, as the load depends on the interference from other stations. We define 
the operator Fk{p_}f) = w/(l — /3) (j){s) 7rj^k(s, z)/C(z)dzds, which yields the value of pk corresponding 

to a given p_^. 


Lemma 2. The operator F[p) = (E'i(p_i) • • ■ Fk{P-k)) tnaps p to a finite K-dimensional interval. 


Proof: For all stations k G 1C, the corresponding Fk{p_ff) is increasing with respect to every coordinate, as 
an increase in the load of any station j k,j G JC produces an increase in the interference received at k. We 
define yk,pff^^ = Ffc(l • • • 1), which represents an upper bound for the load of k. Then, yk,ypk,Fk{p_f,) = 
J^fc(max{l,p_j.}) < Ffc(l • • • 1) = Hence, 

F{p) G n [o,pn, pr^ e R+, yk G 1C. (5) 

keK 


Theorem 3. There exists at least one p* such that F{p*) = p*, p* G 

Proof: This follows from Lemma and •S': if f is a continuous mapping from an iV-dimensional closed 
interval to itself, then the Brouwer’s fixed point theorem guarantees the existence of at least one fixed point. ■ 
It is not feasible in general to draw any conclusion about the uniqueness of the fixed point. However, following 
the approach of p5| , we start from a single cell without interference {p = 0) and iterate function F, so as to model 
a scenario of increasing traffic. 


B. Backhaul Link Load 


We now focus on the backhaul link queue of the eNB of cell c. Let j G ICc be the index of the eNB and PBL,j 
the load of the queue. The probability for a given RN k served by j to be selected for backhaul transmission is 
equal to ^k/ Ahk^K where <1)^ = (j>{s)ds. Hence, pblj can be written as: 


Pbl 


- riRN ^ 


LU 
= o 


^ RsLJ.k ’ 


( 6 ) 
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where the inverse of RBL,j,k is the average of the inverse of the rate on the backhaul link between RN k and eNB 
j, which depends on propagation conditions, and is equal to: RBL,j,k = - fc(T)) ]’ '''here ^BL,j,k{T) 

denotes the SINR on the backhaul of RN k served by eNB j, during RB t. A detailed derivation of RBL,j,k is 
presented in App. 


C. Flow Average Transmission Delay 

The flow average transmission delay is an effective parameter to measure network uplink performance. It is the 
sum of the average access delay and the average backhaul delay. In this section, we derive its formulation. Consider 
a user located in s, served by RN k with donor eNB j. We denote by Dk{s) the average access delay of a flow in 
s, by Dk the average delay of a flow served by k and by DBL,j,k the average delay on the backhaul link. 


Lemma 3. The average access delay in s is expressed by: 


Dk{s) = 




'T^'y,k{s,z) 


■dz. 


(7) 


1 - Pfe 7o (1 - l3)C{z) 

Proof: The delay to transmit a bit of information is the inverse of the UE rate, multiplied by the number Xk 
of UEs served by the same station during a frame (because each UE is scheduled on a fraction \/xk of the RBs with 
MQS). The average transmission delay for the whole flow is hence given by: Dk{s) = ^Et- 




_(l-/3)C(7fc(s,T)) 

Jk{T) — S 


Considering that P{xk = U\xk > 0) = ^(1 — pk), the law of total probability can be used to average Dk{s) 

°° f I n 

over Xk > 0: Dk{s) = 'V' P{xk = U)- -—?— ^IfkA) = s], obtaining expression (7 1 . ■ 

1 -^ Chk{s,T)) 

Corollary 1. The average access delay for a UE served by k G JCb. 


Dk = 


fis) 


lAk 




Dk[s)ds. 


( 8 ) 


Proof: The statement can be verified by means of the Little’s law: Dk = Now, we can 


i-Pk ■ 


substitute Q to the numerator and conclude the proof 


Lemma 4. The average backhaul link delay is expressed by: 


DBL,j,k = 


, k e /C_R, j S ICb- 


(9) 


(1 — PBL,j)ldRBL,j,k ’ 

Proof: Similarly to what performed for Lemmaj^ we proceed by expressing ZJsi j fe as: DBL,j,k = 'p]i^^~i^^r[xBL,j{T)ABL,j{T 


0 ] and apply the law of total probability, obtaining: 


DBL,3,k = -^P(a:BLj(T)=u|a:BLj(T)>0) X 

P^BL,3,k ^ 


Er [XBL,j{T)\XBL,3{T) = v] 


where xbl.j is the number of flows in the backhaul queue of j during the considered frame. Conventionally, we 
set Dbljj = 0- From this equation, we obtain ■ 
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Proposition 1. The average transmission delay for an uplink flow transmitted in cell c is equal to: 


$ 


^ -J^{Dk + DBL,j,k)- 


k^Kc 


( 10 ) 


D. Energy Consumption per Transmitted Bit 

Let e(s) be the energy consumed by a LFE located in s S Ak for transmitting one bit. This metric is sometimes 
called Energy Consumption Rating in the literature |26), 1^. We have; 

T(s) 


t(s) = E 


C(7fe(s,'r)) 

Jo C{-fk[s,z)) 


fkir) = s 

S € Ak 


( 11 ) 


which can be also expressed as a function of Dk{s): e{s) = T{s)Dk{s){l — Pfc){l — fl),s G Ak- The average 
uplink energy consumption per bit If associated to UEs in cell c is given by: 


n = 


^ Ia 

1-/3 


Ar. 


X! / T{s)Dk{s)(l){s)ds. 

kGKa 


( 12 ) 


IV. Optimization 

In this Section, we discuss the optimization of UEs uplink energy efficiency. The proposed algorithm aims 


at minimizing the UE average energy consumption per transmitted bit in (12i, by optimizing P, B and the RN 
placement in the cell of interest, for a given ui, ubn and traffic spatial profile (j){s). Our optimization is constrained 


to respect a maximum tolerable value of Dc in (10 1 , so as to take into account the users experienced quality of 
service. 

RN placement is usually optimized with respect to the downlink performance of the network because of its larger 
traffic volume. On the contrary, we consider here the uplink performance because we aim at finding upper bounds 
on the possible gains that an operator can achieve in terms of energy and/or average delay on this link. Note 
however that with the growing traffic related to multimedia content sharing, social networks and other peer-to-peer 
applications, uplink and downlink traffics tend to be more balancec|^ 


A. Problem Statement 

We assume that RNs can be placed on a discrete and finite grid of candidate sites inside cell while the 
positions of RNs outside cell c are assumed to be already set by the network operator, and not modifiable during 
the optimization. Position of a given RN i is denoted with z{i). Moreover, we define P = [PeNB Prn]- Target 

joint uplink and downlink RN placement optimization is left for further work. 

^This assumption is consistent with real deployment scenarios, where normally only a certain number of locations inside the cell are 
available for RN installation |28| . 
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received powers PeNB and Pam can vary between a minimum and a maximum value, denoted with Pmin and Pmax 
respectively. Similarly, we assume ySmin ^ ^ Pmax (see Section 0. Now, the configuration x of the network is 

defined as the set of positions of the npusf RNs in cell c, plus the adopted P and B: 

X : - • ,z{nRN),P,B} , 

z{i) G A, Vi e {1 • • - Urn}, 

P ■ < P < P B ■ < B < B tni 

mm _ maxj ^min _ ^ _ ^max- 


We name configuration space the set of all configurations, and denote it with fl. 

Our problem is to minimize energy consumption ( [T2) i under delay constraint ( [T0| i: 

minn(a;) 

X 

s.t. Dc{x) < X G G 1R+, 


where constraint (15 i restricts the domain of feasible solutions to the subspace: 


0 = {x G 0 s.t. Dc{x) < £)„„} 


(14) 

(15) 

(16) 


Recall that the computation of the station loads via the fixed point iteration of Section III-A is required for the 
evaluation of the delay. 

Now, the problem ( [l4| ) is in general non-convex, and the cardinality of fl, especially for high urn, makes it 
intractable via exhaustive search. Hence, we rely on stochastic optimization algorithms, and propose a customized 


version of the well-known Simulated Annealing algorithm to solve (14 1 . In the following, we first briefly recall the 
generic SA, and then introduce our version. 

1) Generic SA Algorithm: The SA is a metaheuristic aimed at solving large non-convex problems, which has 
been first proposed by Metropolis | [29) and then applied on a wide range of optimization problems (see, e.g., H), 
0 ). The literature shows that the SA is an effective algorithm, if its parameters are appropriately tuned (see, e.g., 
j^). Let J-{x) denote the associated with configuration x, and consider the problem of minimizing J-{x) 

over the configuration space fl. The SA explores only a subset T* C H of the configuration space, where usually 
|T'| ^ |H|, and is able to And the optimal configuration by means of an appropriate selection of the analyzed 
configurations. At temperature T, the algorithm proceeds by assigning to each configuration x an exponential 
probability, given by: 

exp(-^^) 


Pt(x)= 


Zj' 


, Va? € fi, 


(17) 


where Zt is a normalizing constant. Hence, the solution is the x that maximizes According to the Metropolis- 

Hastings variant ||33| of the SA, the set ^ of configurations to be analyzed is determined according to the following 


^Not to be confused with the energy in J involved in the energy consumption 


ion jl2| . 
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procedure; 

• At step TO = 0 a configuration £c(0) G is arbitrarily selected and designated as current solution a;*(0) for 
step 0; a;*(0) ^ x(0) G 

• At any step to > 0: assume at* (to — 1) is the current solution for step to — 1. The SA will hrst draw a new 
candidate solution x'{m) G for step to, which complies with the given proposal law r(x*{m — l) —t x'{rn)) 
(defined by the algorithm designer). Then, assuming that r is symmetric, x'im) will be accepted as current 
solution for step to with probability: 


PT^{x*{m) ^ x'im)) = 

. / / F ix'Im)) — F ix* im — 1)) 

mm 1, exp- 


where Tm is a parameter called temperature, which decreases to zero slowly enough as to —>■ +oo, and is such 
that Tm > ■ If x'{m) is not accepted, then a:*( to) •(— a;* (to — 1). In most practical implementations 

of the SA, the temperature is updated following a law of the kind = TqH^, h < 1, where h is close to 1. 

2) Proposed Exterior Search Approach for SA: We seek for a minimizer of n(a:) over (l, where the cardinality 
of the feasible conhgurations space depends on the value of the constraint: |ri| = /(Dmax)- In particular, / is an 
increasing function of Z^max, because the number of feasible configurations increases as the constraint loosens. 

When I?niax is small, we can reasonably expect to have |1Z| ^ |n|. This may reduce the effectiveness of the SA, 
as the energy surface that the algorithm explores could be fragmented into many isolated regions, some of which 
are unreachable for the algorithm. Moreover, we can expect the optimal solution to tightly respect the constraint, 
i.e., to lie close to the border of Cl Thus, the standard SA may not appropriately cover the region where the 
solution is, as many of its configurations are non feasible. 

This problem can be solved by means of an exterior search approach p5j , p^ . It consists in extending the 
search to configurations outside Cl, while adding a penalty to the energy of those conhgurations that violate the 
constraint. This method presents analogies with the Lagrangian relaxation method (see the classic works 0,|38)), 
and it is regarded as a powerful instrument to deal with constrained optimization problems (see, e.g., p9) , pO)). 
This idea is illustrated in Fig. where we see on a hctitious example how exterior search can reduce the path to 
the optimal conhguration. 

We thus adopt an exterior search approach, and reformulate ([l4|) as: 


min J^(a;), £C G n, (18) 

X 


where F{x) = n(a;) + 


and where V(x) is a function fl i-G [R accounting for constraint (15 i and named penalty function, which plays a 
crucial role in the optimization 


Penalty functions are classihed into static, dynamic, and adaptive. A static penalty function does not change 
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Figure 4; Exterior search principle: the search from the initial configuration Xq to the optimal configuration x 
is extended to configurations outside 17 (in red: interior penalty, in blue: exterior penalty). 


during the optimization. It can be represented, e.g., by a fixed constant added to the energy of those configurations 
a; ^ 17, or by a function proportional to the Euclidean distance of the considered configuration to the feasible region 
ED- On the contrary, dynamic penalty functions can be adjusted in accordance to the progress of the optimization. 
A common approach is to gradually increase the penalty with the number of explored configurations | [42) , so as 
to guarantee the convergence of the optimization towards a feasible solution. Einally, an adaptive penalty function 
(see, e.g., | |40) , ED) considers further aspects of the search, such as steering the algorithm towards regions of the 
energy surface which are deemed promising for the search. 

We propose the following exterior penalty function, which is dynamic and adaptive at the same time: 


V{x^ 77max) — (y.{jTl 1) 


n(3:) 




(19) 


where a is a constant and m is the SA step. The benefits of using function are manifold. Eirst, the penalty is 
proportional to the violation of constraint i7„iax, favoring the exploration of configurations which are out of 17 but 
close to it, while penalizing more those which are far. At the same time the penalty is independent of the adopted 
value of I7max, depending rather on the percentage of exceeding delay, with respect to Z7max- This represents an 
important feature when dealing with constraints of a different order of magnitude, compared to n(a;). Moreover, 
the penalty is proportional to n(a;), so as to favor the exploration of configurations outside 17 when they appear to 
be promising, i.e., their energy is sensibly lower to that of the current solution (hence the adaptive nature of ([T^). 
Einally, acting on a we can modify the percentage of acceptance for configurations outside 17. We use here the 
works of Geman and Robini |44|, ED' Choosing a{m— 1) > log(TO) is sufficient to guarantee convergence of the 


SA algorithm towards a feasible minimizer. App. [P] explains why the convergence property of the SA is maintained 
with such a penalty function. 

Surprisingly, stochastic constrained optimization based on exterior penalties has been seldom employed in wireless 
networks optimization. There are however three recent interesting references using this method ED~ED- this 
context, two important problems arise: 

1) The generic choice of the penalty term itself. To our knowledge, the most interesting analysis appears in 
BSl, using the so-called logarithmic barrier. Our approach is somewhat similar: since our penalty term is the 
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product of the objective function n(a;) and the constraint D(.{x) — its gradient is a linear combination 
of the gradients of both previous expressions. Thus, in a deterministic, continuous framework, during a phase 
when the penalty is active (Dc{x) > D^.„) it adapts the minimizing direction search to both the objective 
function and the constraint opposite gradients (see Fig. [^. We expect the stochastic, discrete SA to behave 
in a somewhat similar way. 

2) The selection of the Lagrangian multipliers and their evolution during the algorithm. Apart from the usual 
choice of constant parameters, an interesting development can be found in ph) : at each step of the SA 
algorithm, each penalty weight itself either increases if the associated constraint is violated, or decreases (in a 


geometric way) if this constraint has been satisfied during several previous steps. In |47| all penalty weights 
increase regularly. 

To the best of our knowledge, our approach is the first one to use stochastic constrained optimization for station 
placement in wireless networks. The originality of our work also lies in the multiplicative penalty function and the 
theoretical choice of the penalty coefficient a{m — 1). 

3) Initial Temperature Choice: Following gg, we address the problem of finding a good value for the initial 
temperature Tg in each considered optimization problem via dichotomic search during a series of preliminary runs 
of the algorithm. 


V. Results 


A. Considered Scenario and SA Parameters 


The proposed theoretical framework is here applied on a test scenario drawn according to |12 case 1] for 
what concerns propagation, shadowing and stations transmit power. System bandwidth is 10 MHz. The network is 
composed by one central eNB surrounded by 6 eNBs regularly distributed around it on a circle of ray 500 m. We 
optimize RN placements in the central cell only, while RNs in the surrounding cells are assumed to be regularly 
distributed on a circle of radius 160 m around their donor eNB. All stations have omnidirectional antennas. The 
same realization of shadowing (drawn according to GD) is used for all simulations, so as to ensure that results be 
comparable. If not otherwise mentioned, we adopt a uniform traffic density spatial distribution, i.e., is constant 


VsGv4„u,, in order to draw general conclusions about energy efficiency. In Section V-B5 we show the performance 
under a non-uniform traffic distribution. We set 13=0.1. 

Function C is approximated by means of the Modulation and Coding Scheme (MCS) indicated in ||50), so as to 


take into account the effect of an upper-bounded capacity function, which is the typical case in real deployments. 
Also, fast fading on the backhaul link is not considered. This choice is justified by the assumption that RNs do 
not move, and optimization of their positions is performed on a long-term basis (see, e.g. my We have assumed 
‘Bmin=0 and ;Bniax=15 dB, with a step of 1 dB. The grid of candidate RN sites has a step of 50 m. 

The fixed point of F{p) is found by iteratively computing stations loads. At iteration t, probabilities Tr-^^k{z,s) 
are computed according to — obtained at iteration t—1, and then fed into Q to get p{t)=F{p{t—l)), starting 
from p(0) = 0. Iterations are stopped when either \pk{t) — pk{t — l)\<Q.01,yk G 1C, or 3k : pk{t) > l,k G JC.\t\ the 
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latter case, the analyzed configuration is labeled as non-valid. Similarly, fixed point iterations are used to compute 
the loads on the backhaul link, after p is determined. We were not able to analytically prove the convergence of 
the fixed point iteration (by showing for example that is a contraction mapping). We have however numerically 
observed that the iteration always converges in less than 10 iterations. 


SA algorithm is implemented as described in Section IV Once again, we emphasize the need to optimize 
network parameters and in particular RN locations in order to obtain upper bounds on the achievable performance. 
The algorithm yields a solution after 45 temperature steps. For each tested temperature value, the energy of a certain 
number of network configurations is calculated. This number varies according to urm, e.g., for nuM = 2, 400 
network configurations are tested at each temperature step. For every optimization, the SA is repeated 4 times, and 
the best solution among the 4 obtained solutions is elected as a final result. The configuration corresponding to the 
final result is denoted with a;*. 


B. Numerical Results 

We denote with the configuration with minimum energy when no RNs are deployed in the whole network, 
and with no(a:g) its corresponding energy. Unless otherwise mentioned, results are expressed in terms of the energy 
consumption ratio n(a;*)/no(a:5)’ so as to show the energy gain (or loss) resulting from RN deployment, and they 
are plotted against the normalized constraint D^nax/D q, where Dq is the value of Dc corresponding to Xq, i.e., the 
average delay when no RNs are deployed. Hence, a point in the region n(a;*)/no(a;Q)<l corresponds to an energy 
consumption per bit gain with respect to the network with no RNs, whereas a point in the region Z3„iax/F?o<l 
corresponds to an average uplink transmission delay gain. All the curves that we obtained exhibit a hyperbolic-like 
shape, which follows from the nature of our constrained optimization. When I?max is large, |r2| « |H|, the delay 
doesn’t play any role in the optimization and so the energy consumption gain reaches its maximum. Instead, if the 
constraint is tight we have |H| ^ |U|, and it is unlikely that any configuration that performs well in terms of energy 
efficiency lies in U. 

1) Energy - Delay Trade-off: Fig. shows the trade-off between energy consumption and delay, for a varying 

number of RNs. Note how RN deployment can bring consistent UEs energy savings. Let consider the region where 
D-aiax/Do>l. We observe that the energy consumption gain increases with This is an expected result, as 

RNs reduce the average distance between UEs and their serving station. The corresponding increase in uplink 
interference is mitigated by a lower users transmit power. On the contrary, a lower number of RNs performs better 
when Djnax/Do < 0.5. This behavior is related to the cardinality |U| of the valid configurations set, which tends 
to decrease with I?max/E* 0 j narrowing the space for network optimization and reducing the achievable energy 
consumption gains. In this case adding RNs is detrimental for energy efficiency, as more UEs served by RNs imply 
a higher backhaul delay, that further reduces |U|. 

2) Effect of Offered Traffic: We analyze the effect of (D on energy consumption in Eig. where n(a;*) and 
Fornax are scaled with the constants no(a;g(w = 5)) and E)g(d) = 5) resp., obtained in a network with no RN and 
a traffic density w = 5 bits/s/m^. This choice has been made in order to keep the same scaling constants for all 
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Figure 5: Energy consumption ratio n(a:*)/no(a;5) 
vs delay -Dmax/£*o (w = 5 bits/s/m^). 



Figure 6: Energy consumption ratio 
n(a;*)/no(a;^(w = 5)) vs delay D^^^/Do{u) = 5) 
for varying Q inuN = !)■ 


curves. As we can see, the deployment of 1 RN is sufficient to reduce the average energy consumption of UEs 
to less than a half, compared to the eNBs-only network. Now, let focus on the line Dq{u} = 5)=1 (same 

delay as in the eNB-only case). We note that adding a single RN can help the terminals to be more energy efficient 
without touching the quality of service and even if the load increases to = 20 bits/s/m^. Nonetheless, beyond 
approximately w = 30 bits/s/m^, delay and interference negatively affect user performance. 

3) Energy Efficiency of RNs vs Small Cells: Fig. |7] compares the results obtained with our system model, with 
those obtained using small cells instead of RNs (i.e., (3 = 0, DBL,j,k = 0, Vj G Aiis, VA: G ICr), highlighting the 
difference of our work with respect to those dedicated to devices with wired (or ideal) backhaul. As expected, 
RNs allow for a smaller energy consumption gain compared to small cells, for a fixed Umax- This is due to the 
transmission delay on the backhaul link, to the increased delay on the access link (due to /3 < 1) and to the 
constraints on RN placement related to backhaul path-loss and shadowing. Performance of RNs is more impaired 
when the traffic density or 71^^? increase. However, RN deployment still yields consistent uplink energy consumption 
gains. 

4) Effectiveness of Exterior Search with Penalty Function: Fig. compares the results of the optimization using 
penalty function and exterior search, with those obtained by means of an interior search. Both the interior and 
the exterior search have been carried on with the same number of iterations, for an unbiased comparison. The 
exterior search proves to be more effective when the delay constraint is tight, as (l is expected to be fragmented 
and |H| <C |H|. No meaningful gain in terms of search effectiveness is observed when the constraint is loose, as in 
this case the two approaches tend to coincide. 

5) Effect of Offered Traffic Spatial Distribution: Fig. compares the results obtained using a uniform (j){s) with 
those obtained when (j){s) is the sum of a uniform traffic profile and a bi-dimensional gaussian function, centered 
at s : {a; = 115; y = 143} m, with the same average traffic density. We can see that RNs allow for larger energy 
consumption gains when traffic is non-uniform. This is due to the flexibility of the RN solution that allows relays 
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Figure 7: Comparison between wireless RNs and 
wired small cells, for varying w and nuN iP = 0 . 1 ). 



Figure 8 : Comparison between exterior and interior 
search approach effectiveness (w = 5 bits/s/m^, = 
0.1, rifljTv = !)■ 


to be placed close to the hot-spot center. 


VI. Conclusion 

We have proposed a comprehensive framework for the optimization and performance evaluation of uplink energy 
consumption per bit in relays-enhanced cellular networks. This framework considers the arrival and departure of 
users, and the loads of network stations. Moreover, shadowing and fast fading are both taken into account in the 
propagation model. A realistic radio resource scheduling scheme is assumed, and its impact on users performance 
is derived. A customized optimization algorithm based on exterior search with penalty functions is proposed for the 
optimization, which is carried on under quality of service constraints. Results show a meaningful boosting of users 
energy efficiency given by the deployment of RNs, even if the average flow delay is imposed to be the same as in 
the network without relays. Proposed exterior search approach is shown to be more effective than the traditional 
interior search. 


Appendix A 
Derivation of and 

We use here a simple and classical approach consisting in introducing a auxiliary Random Variable (RV). 0 

can indeed be rewritten as: 7 fc(s) = -— ^e^Pu(f')x+Nx ~ Pk{s)xRk, where cc is a realization of a lognormal 

RV AT, and is the denominator. Note that Rk does not depend on s. Several authors have shown that the 
interference term R = can be well approximated by a lognormal distribution Log— 7 ^, crf^) 

even when the number of interferers is variable (see e.g. 0 ). In | |20) , it is also shown that the product 

of an exponential RV (Pfe(s)) and a lognormal RV (X) as well as the sum of two lognormal RVs can be again 
approximated by a lognormal RV. Parameters cr-y./c) are then obtained by moment matching. The derivation 
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Figure 9: Energy consumption vs delay tradeoff; w = 0.1, /3 = 0.1, comparison of uniform and non-uniform 
traffic profile (j){s) for varying n^jv- 


of which is less standard, is now detailed. The mean Mij^ of Ik = is equal to; 


Mij, = 


0 


[PkU,)] 

Qj(s) 


3^k • 


U, Pj 


T{s)Gk{s)ds 


Pj^j,k^ 

j^k 


where Yj^k = ^^^T{s)Gk{s)ds. (1) is obtained by weighting the power received from location s with the 

local load in s. Note that T{s)Gk{s) are input parameters of the considered deployment scenario (propagation and 
transmit power assumptions). Now gj{s) depends on the SINR distribution in s and hence on the first moment Mi 
of Ik- In order to avoid an additional complexity to our model, we make the approximation Qj{s)/pj « (/)(s)/<i>j. 
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This is justified by the fact that in a urban environment, it is unlikely that UEs transmit at their maximum power 1131 
so that every user of j is received with the same average power. The approximation comes then from the expressions 
ss of gj{s) and pj. In the same way, we derive the variance M 2 p^. of Ik- 

where Hj ^^^T^(s)G'^(s)ds. Again, we approximate g{s)/pj « (/)(s)/$j. Finally, and aj^ are found 
by matching Mij^ and M 2 j^ with the mean and variance of the approximating lognormal. 


Appendix B 

Proof of Theorem[^ Derivation of ir^^kis, z) under MQS scheduling 
The MQS scheduler orders the values of SINR 7 fc(s, t — j),j G {0 • • • VP — 1} of each UE in ascending order. 
The ranking Tiir) of UE i located in Si on RB r is the ranking of ‘jk{si, t) in the ordered vector 'yk{si,T — j), j G 
{0 ... VP — 1}. The lower the ranking, the better the SINR on r (wrt the SINR on previous blocks). Station k assigns 
RB T to the UE with the lowest ri{T), % G {1 • • • C/}. We assume VU large enough so that the probability for two 
or more UEs to have the same ranking is negligible. We have; P(ri(T)=n)=l/VU, Vn G {1 • • • VP}|^ Moreover, the 
probability for any UE served by k to be scheduled is P(/fe(r)=Si) = 1/C/, Vz G {1 • • • C/} (due to the fairness in 
RBs allocation). Let 11.^ ^(si, z) = p^^k{si,t)dt denote the Cumulative Distribution Function (CDF) of jk{si, t). 
We have: 


P(ri(r)=n | 'yk{si,T)=z) = 

n7.fe(s*, z)Y' 


( 20 ) 


We denote Qnih z) the right hand side, which does not depend on U. Now, we derive tt.^ ^.(s, z), given U. Applying 
the Bayes theorem on tt.^ fc(s,z) we obtain: 


7r7,fc(s,2;) 


P{fk{T)=Si I 'yk{Si,T)=z)pj^kiSi,z) 
P{fkiT)=Si) 

p ifk{r)=Si I -ik{Si,T)=z)p^^k{si, z)u. 


( 21 ) 


We then work on P {fk{T)=Si \ 'yk{si,T)=z) applying the law of total probability, conditioning it with respect to 
the possible rankings r^jr): 


P(//c(T)=Si I Jk{Si,T)=z) = 

W 

^P(ri(T)=n I 7fc(si,T)=2) X 
n—1 

P{fkiT)=Si I 'ykisi,T)=z,ri{T)=n). (22) 

The probability of being scheduled depends on the instantaneous SINR only through the ranking: once we know 
rj(r), knowing 7 ^( 5 ^, r) does not add any additional information regarding the probability of being scheduled by 

^as in our block Rayleigh fading environment each realization of 'yk{si,T) is independent of the others. 
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k. Hence, 


Pifk{T)=Si I 'yk{si,T)=z,ri{T)=n) = 


p Uk{T)=Si I ri{T)=n) = 


fW 


\ W 


u-i 


and ([2T| becomes 


w 




n—l 


W-' 

w 


u-i 


(23) 


(24) 


So far, we have found 7r,y ^(s, z) given U. We now define = {W — n)lW, and use the law of total probability, 
summing up 'k^^].{^s,z) given U for all the possible values of U, and obtaining its general expression: 


IP(7fe(Sj,T) = Z I fk(T)=Si,U > 0) = 


w 


U =1 


where 


P7,k(Si,z)J2Qn(h^)(l-Pk)J2/^k ^ 

n—l 

oo 

(1 - pk) ^ = 


W^(l-Pk) 


(7=1 


(W-pk(W-n))^ 


(25) 


(26) 


We name Tk(W, n) = W^{1 — pk )/ (W — pk(W — n)) and get the conclusion. 


Appendix C 

Backhaul Rate Derivation 

We start from the definition of RBL,j,k as RBL,j,k = ^/^t[ c('-ibl ■ fc(T)) ]’ where jBL,j,k{T) is given by: 


lBL,j,k{T) 


PBL,j{sRN,j,k) 

T,hGJCB,h^J ^BL,h(T)PBL,j(fBL,h(T)) +N’ 


( 21 ) 


where Qblm is a binary RV indicating whether the backhaul of h is active on r, i.e., P{QBL,h = 1) = PBL,h\ 
PBL,j(s) is the power received by eNB j from a RN located in s; siiN,j,k indicates the location of RN k served 
by eNB j and fBL,h{T) yields the location of the RN scheduled by h for backhaul transmission on r. 

During each RB r a RN k in cell j can be scheduled for uplink transmission with probability ps{j,k) = 
PBL,],k ^ where PBL,j,t = {iP^t/{/3RBL,j,t))- Let T = \ICb\ denote the total number of cells in our network, and 
€ t^BiijiT) S {0- • -n/jAf} denote the index of the RN scheduled on the backhaul of eNB j on RB r, 
where ijir) — 0 means that no RN has been scheduled for transmission. Now, assuming that backhaul scheduling 
decisions in a given cell do not depend on those taken in other cells, we have that the probability of having a given 
set • • • iriT)} of scheduled RNs on RB r is equal to 


P(/bL,i(t) = SitJV.t.ii, • • • , /bL.t(t) = SBAT.T.zt) = 

Y[Ps{jPj{T)). (28) 

j 
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There are (n/j 7 v + 1)^ possible scheduling combinations in all cells. We assign an index (5, <5 = {1 • • • (ti/jat + 1)^} 
to each combination, and denote V((5) = PsOi Finally, we apply the law of total probability to obtain 


RBL,j,k = 


(29) 


where we assume PBL,j{sRN,h,o) = ((■ 


Appendix D 

Some properties oe Gibbs distributions for penalized energies 

The purpose of this Appendix is to give a hint to the ’’good” convergence of a SA process when an increasing 
exterior penalty such as ( [T9| ) is added in the global energy, coupled with an adequate SA temperature scheme. 
Consider such an augmented energy: 


J-{x) = U{x) + fj, ^{x), 

-^max 

with for instance /i = = a{m — 1) > 0. We say that $ is a penalty if the set of its minimizers 

Q* = { X € a, s.t. $(a:) = $* = min$(y) } 

yen 


(30) 

(31) 

(32) 


is exactly the feasible subspace fl (this holds for (19l). Now let us in general define the following set of ’’iso- 
constrained” subspaces: e€lRH'n£ = {a;Gn s.t. <i)(a:) = e}, and consider the exponential distribution given in 
(17 1 (see also ||44)). For any value of e such that fig ^ 0, one has: 

P{X = y) exp-P{y) 


Vtc, y G 


P{X = x) exp— 

exp - [ U{y)+y <^{y) ] 
exp — [ U{x) + fi $(a;) ] 
exp-17(y) 


exp —U{x) 


So one can safely write in such an iso-constrained subspace: 


P{X = x \x Gn^) = 


exp —U{x) 


Eyea exp-C/(y)' 


(33) 


(34) 


which is also an exponential distribution. 


Two key points are now at stake in view of SA purposes |44|, |45|: 


• First, as /r —> -foo (/i is similar to an inverse temperature associated to the constraint $(a;)), the global 
distribution ([T7| becomes concentrated on the subspace fl* defined in (32i while keeping the exponential 
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form (34i. The proof is classical; one can re-write G3 


as 


P{X = x) = 


exp — [ U(x) + ^ <I)(a;) ] 

Eyen exp - [ (y) + ^ $(y) ] 

exp — [ U{x) + ^ ($(ic) — $*) ] 


Ey^n exp - [ (y) + ($(y) - $*) ] 

Now, either <i)(a;) > $* and exp(— ^ — *&*)) —>■ 0 when ^ —>■ +c», or $(a;) = $* and in this case, 

exp(— fi (<i>(a;) — $*)) = 1 Vy. 

Then, consider distribution ( [T7] i with a temperature parameter T > 0. It can be written as: 

U{x) + fj, <i)(a;) 


P^{X = x) = 


exp-[ 


T 


EyGnexp-[ 


uiy)+f ^’(y) 

T 


exp-[ ^ ($( 0 .) - $*) ] 

S^GO exp -[ + I (ci>(y) - eh*) 


where U* is the minimum of value of objective function U{.) on fl*. Now, if both T —> 0+ and y — 
+00 s.t. ^ ~ log m (new inverse temperature), a similar analysis as before, now in two steps shows 
that for the penalty case where 17* = 17, the distribution of interest Pt( ) concentrates on those configurations 
with minimal energy U* on feasible subspace 17 (see rigorous proof in |j^, |j^). 
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